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Abstract 

It is known that graphs on n vertices with minimum degree at least 3 have spanning 
trees with at least n/A + 2 leaves and that this can be improved to (n + 4)/3 for cubic 
graphs without the diamond K4 — e as a. subgraph. We generalize the second result by 
proving that every graph with minimum degree at least 3, without diamonds and certain 
subgraphs called blossoms, has a spanning tree with at least (n+4)/3 leaves, and generalize 
this further by allowing vertices of lower degree. We show that it is necessary to exclude 
blossoms in order to obtain a bound of the form n/3 + c. 

We use the new bound to obtain a simple FPT algorithm, which decides in 0{m) + 
0*{Q.lb^) time whether a graph of size m has a spanning tree with at least k leaves. This 
improves the best known time complexity for Max Leaf Spanning Tree. 

1 Introduction 

In this paper we study spanning trees with many leaves. We prove a new extremal result, 
and apply it to obtain a fast FPT algorithm for the related decision problem MaxLeaf. 

We first introduce the extremal problem and explain our contribution. Throughout this 
paper G is assumed to be a simple and connected graph on n > 2 vertices. Other graphs 
may be multi-graphs, disconnected, or a Ki. The minimum vertex degree of G is denoted 
by 5{G). Vertices of degree 1 are called leaves. 

Linial and Sturtevant [12] and Kleitman and West [11] showed that every graph G with 
5{G) > 3 has a spanning tree with at least n/4 + 2 leaves, and that this bound is best possible. 
The paper [11] also improves on this bound for graphs of higher minimum degree. 

The examples showing that n/4 + 2 is best possible for graphs of minimum degree 3 all 
consist of cubic diamonds connected in a cyclic manner. A diamond is the graph K4 minus 
one edge, and an induced diamond subgraph of a graph G is a cubic diamond if its four 
vertices all have degree 3 in G, see Figure 1 (a). 

Since these examples are very restricted it is natural to ask if better bounds can be 
obtained when diamonds are forbidden as subgraphs. This question was answered by Griggs, 
Kleitman and Shastri [10] for cubic graphs, which are graphs where every vertex has degree 3. 
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Figure 1: A cubic diamond (a), a 2- necklace (b), and a 2-blossom (c). 



They show that a cubic graph G without diamonds always admits a spanning tree with at 
least n/3 + 4/3 leaves. For minimum degree 3 the following bound is proved in [2]. A graph 
G with d{G) > 3, without cubic diamonds, contains a spanning tree with at least 2n/7+ 12/7 
leaves. Both bounds are best possible for their respective classes. 

In [2] it is conjectured that the following statement holds, which would improve the bound 
2n/7+ 12/7 with only a minor extra restriction, and would also generalize the result for cubic 
graphs from [10]: Every graph G with 5{G) > 3 and without 2-necklaces contains a spanning 
tree with at least n/3 + 4/3 leaves. Informally speaking, a 2-necklace is a concatenation of 
A; > 1 diamonds with only two outgoing edges, sec Figure 1 (b). 

In Section 2 we disprove this conjecture by constructing graphs with 6{G) = 3 without 2- 
necklaces, which do not admit spanning trees with more than 4n/13+2 leaves. On the positive 
side, we prove that the statement is true after only excluding one more very specific structure, 
called a 2-blossom, sec Figure 1 (c). Precise definitions of 2-necklaces and 2-blossoms are given 
in Section 2. So we prove that graphs G with 6{G) > 3 without 2-necklaces or 2-blossoms 
have a spanning tree with at least n/3 + 4/3 leaves. In fact we generalize this statement even 
further by removing any restriction on the minimum degree. The resulting statement is given 
in Theorem 1, which is our main result. 

Let V>3{G) denote the set of vertices in G with degree at least 3 and n>3(G) its cardinality. 
Let £{T) be the number of leaves of a graph T. 

Theorem 1. Let G be a simple, connected graph on at least two vertices which contains 
neither 2-necklaces nor 2-blossoms. Then, G has a spanning tree T with 



Section 2 shows that Theorem 1 is also best possible for non-cubic graphs. Without proof 
we remark that also other results can be generalized in a similar way, e.g. it is not hard 
to extend the proof in [11] to prove that all graphs G have a spanning tree with at least 
(n — n2(G))/4 -|- c leaves, where n2{G) denotes the number of vertices of degree 2 in G. 

Our proof of Theorem 1 is constructive and can be turned into a polynomial time algorithm 
for the construction of a spanning tree. The main technical contribution of this paper is that 
we prove this generalization of the statement in [10], and improvement of the statement 
in [2], without a proof as lengthy as the proofs in these two papers. This is made possible by 
extending the techniques and proofs from [10]. In Section 3 we argue that the long case study 
in [10] actually proves a strong new lemma, which we use as an important step in the proof 
of Theorem 1. We share the opinion expressed in [10] that a shorter proof of the bound for 
cubic graphs might not exist. Therefore using that result in order to prove the more general 
statement seems appropriate. 

We now explain the consequences that Theorem 1 has for FPT algorithms (short for fixed 
parameter tractable) for the following decision problem. 



V>3(G) 
n>3(G) 
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Max-Lcavcs Spanning Tree (MaxLeaf): 
INSTANCE: A graph G and integer k. 

QUESTION: Does G have a spanning tree T with t{T) > k? 

It is known that MaxLeaf is AAP-complete, see [9]. When choosing k as a parameter, 
an algorithm for this problem is called an FPT algorithm if its complexity is bounded by FPT 
f{k)g{n), where g{n) is a polynomial. See [8] and [5] for introductions to FPT algorithms, algorithm 
f{k) is called the parameter function of the algorithm. Usually, g{n) will turn out to be parameter 
a low degree polynomial, thus to assess the speed of the algorithm it is mainly important function 
to consider the growth rate of f{k). Though since MaxLeaf is A/'P-complete, f{k) will 
most likely always be exponential. Bodlaender [1] constructed the first FPT algorithm for 
MaxLeaf with a parameter function of roughly (IT/c^)!. Since then, considerable effort 
has been put in finding faster FPT algorithms for this problem, see e.g. [4, 7, 3, 6, 2]. The 
papers [3, 6, 2] also establish a strong connection between extremal graph-theoretic results and 
fast FPT algorithms. In [3], the bound of n/4-|-2 from [11] mentioned above is used to find an 
FPT algorithm with parameter function C 0*(9.49*^). Here the O* notation ignores O* 

polynomial factors. With the same techniques the bound of 2n/7+12/7 mentioned above is is 
turned into the so far fastest algorithm, with a parameter function in 0*{{^'^^) C 0*(8.12'^') 
([2]). Similarly Theorem 1 yields a new FPT algorithm for MaxLeaf, presented in Section 4. 

Theorem 2. There exists an FPT algorithm for MaxLeaf with time complexity 0{m) + 
0*(6.75'^), where m denotes the size of the input graph and k the desired number of leaves. 

This algorithm is the fastest FPT algorithm for MaxLeaf at the moment, both opti- 
mizing the dependency on the input size and the parameter function. It simplifies the ideas 
introduced by Bonsma, Brueggemann and Woeginger [3] and is also significantly simpler than 
the other recent fast FPT algorithms. Hardly any preprocessing of the input graph is needed, 
since Theorem 1 is already formulated for a very broad graph class. We end in Section 5 with 
a discussion of possible extensions and further consequences of Theorem 1. 



2 Obstructions for Spanning Trees with Many Leaves 



2.1 Diamond Necklaces, Blossoms and Flowers 

As mentioned in the introduction 2-necklaces have been identified as an obstruction for the 
existence of spanning trees with n/3 -|- c leaves in graphs with minimum degree 3, see [11] 
and [2]. In this section we show that they are not the only such obstruction. We start by 
precisely defining 2-necklaces and 2-blossoms. 

The degree of a vertex u in a graph G is denoted by dciv) and by d(v) if ambiguities can 
be excluded. A vertex u of a subgraph H oi G with dniv) < dc{v) is called a terminal of H. terminal 

Definition 1 (2-Necklace). The graph K4 minus one edge is called a diamond and denoted diamond 

by A^i. The degree 3 vertices are the inner vertices of the diamond. inner 

For k>2 the diamond necklace is obtained from the graph Nk-i and a vertex disjoint vertices 

Ni by identifying a degree 2 vertex of A^i with a degree 2 vertex of N^-i- Thus, has again diamond 

two degree 2 vertices, which are denoted by ci and C2- necklace 

An Nk subgraph of G is a 2-necklace if it only has ci and C2 as terminals, which both have 2-necklace 
degree 3 in G. See Figures 1 (a) and (b). If G contains an Ni this way, this Ni subgraph is 

also called a cubic diamond of G. cubic 

diamond 
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Diamond necklaces will also be called necklaces for short. In the course of studying the 
leafy tree problem we found that the subgraphs defined next are also an obstacle for the 

existence of spanning trees with n/3 + c leaves. 

Definition 2 (2-Blossom). The graph B on seven vertices shown in Figure 2 (a) is the 
blossom graph. A blossom subgraph i? of G is a 2-blossom if ci and C2 are its only terminals, 
and they both have degree 3 in G, see Figure 2 (b). 




Figure 2: A blossom graph (a), a 2-blossom (b), and a flower (c). 

If G contains a 2-blossom B, only the vertex b has degree 4 in G, and the remaining 
vertices of B have degree 3 in G. The two outgoing edges of a 2-necklace respectively a 
2-blossom may in fact be the same edge, in that case G is just a 2-necklace respectively 2- 
blossom plus one additional edge. The next proposition shows how many leaves can be gained 
within a blossom. 




Figure 3: Spanning trees restricted to a blossom 

Proposition 1. Let G be a graph with a blossom subgraph B that has ci and C2 as its 
only terminals. A spanning tree T of G exists with maximum num,ber of leaves, such that 
E{T) n E{B) has one of the forms in Figure 3. 

Proof. Consider a spanning tree T of G with maximum number of leaves. We may distinguish 
the following two cases for E{T) n E{B): either this edge set induces a tree, or it induces a 
forest with two components, one containing ci and the other containing C2. 

In the first case, at most three non-terminal vertices of B can be leaves of T, since a path 
from ci to 02 contains at least two internal vertices. In addition, if one of ci and C2 is a leaf 
of T, then T can be seen to have at most two non-terminal vertices of B among its leaves. 
Since ci and C2 together form a vertex cut of G, one of them is not a leaf in T. It follows 
that replacing E(T) n E{B) by the edge set in Figure 3 (a) does not decrease the number of 
leaves. Since this edge set forms again a spanning tree of B, the resulting graph is a spanning 
tree of G. 

Now suppose E{T) n E{B) forms two components. At most four non-terminal vertices 
of B can be leaves of T. If one of ci and C2 is a leaf of T, then T can have at most three 
non-terminal vertices of B among its leaves. One of Ci and C2 is not a leaf in T, and thus it 
follows again that replacing E{T) n E{B) by the edge set in Figure 3 (b) does not decrease 
the number of leaves, while maintaining a spanning tree of G. □ 

We now present a family of graphs with minimum degree 3 which do not contain diamond 
necklaces but do not have spanning trees with n/3 + c leaves. 
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Definition 3 (Flower, Flowerbed). The flower graph is the graph on thirteen vertices shown flower graph 
in Figure 2 (c). 

The flowerbed Ri of length i consists of i flowers, connected in a cyclic manner, see Figure 4. flowerbed 
Formally, Ri is constructed by starting with i disjoint flowers, and adding i edges in such a 
way that the graph is connected and has minimum degree 3. 

Figure 4 shows the flowerbed R5. The solid edges show a spanning tree with 4n/13 + 2 
leaves, which we will show to be optimal. 




Figure 4: The flowerbed R5. The solid edges show a tree with maximum number of leaves. 

Proposition 2. The flowerbed Ri has no spanning tree with more than 4n/13 + 2 leaves. 

Proof. Let F be a flower in Ri containing a blossom B, where ci and C2 are the terminals of 
B. The neighbor of Cj not in B is called fj {j = 1,2). We will argue that no spanning tree 
T of Ri has more than four leaves among V{B) U {/i, /2}. Proposition 1 shows that without 
loss of generality we may assume that E(T) n E{B) has one of the two forms in Figure 3. If 
it has the first form, then one of /i and /2 may be a leaf of T, but not both since together 
they form a vertex cut of Ri. If E{T) n E{B) has the second form, then /i and /2 are both 
cut vertices of T, so neither can be a leaf. 

Now we consider the other vertices of Ri that may be leaves in T. Let C be the cycle in 
Ri that joins the i flowers, that is, the facial cycle of length 2i in Figure 4. Suppose v G V{C) 
is a leaf of T. In G — v, all vertices of C except one are cut vertices, so T may have at most 
one other vertex of C as a leaf. It follows that at most two vertices from C can be leaves in 
a spanning tree T oi Ri. The remaining vertices of Ri that we have not considered yet (two 
for every flower) are cut vertices of Ri and therefore not leaves of T. 

Summarizing, any spanning tree has at most two leaves in C, and at most four additional 
leaves for every flower. The statement follows. □ 

2.2 Tightness of the Bound 

The bound n/3+4/3 for cubic graphs is shown to be tight in [10]. Infinitely many examples are 
given with no more than n/3 + 2 leaves. On the other hand it is shown that there exists only 
one graph that ensures that the additive term 4/3 can not be increased: the 3-dimensional 
cube Qs, which has eight vertices and only admits four leaves. 

Because the bound is best possible for cubic graphs, our bound is best possible as well. 
But also graphs with arbitrarily many vertices of higher and lower degree can be constructed 
which do not admit more than n>3/3 + 2 leaves. Figure 5 (a) shows such an example with 
many degree 2 and degree 4 vertices (which is closely related to one of the examples from [10]). 
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The reason that the additive term cannot be increased to 2 is again only one example: 
Figure 5 (b) shows a graph on n = 7 vertices that only admits 4 = n>3/3 + 5/3 leaves. This 
graph will be called G-j in the remainder. This graph is in fact a blossom plus two edges; 
deleting any edge between two degree 4 vertices yields a 2-blossom.An additive constant of 
5/3 is possible for non-cubic graphs, but we will not prove this statement in this version of 
the paper. 




Figure 5: Non-cubic extremal graphs. 



3 Proof of the Main Theorem 

This section is devoted to the proof of the main theorem. We first sketch the proof and give 
an overview of the different ingredients that will be used. First we introduce a number of 
reduction rules in Section 3.1. These reduction rules are applied to the graph G until an 
irreducible graph G' is obtained. These rules have the property that if the main theorem 
holds for every component of G', it also holds for G. In the next sections, we therefore only 
have to consider irreducible graphs. In Section 3.2 we argue that the proofs from [10] for cubic 
graphs in fact show that if a non-spanning forest F of G" is given, that contains all vertices 
of G' of degree at least 4, then one of the trees of F can be extended to a larger tree while 
maintaining the proper leaf ratio. Finally, in Section 3.3 we show how to obtain this starting 
forest F that covers all high degree vertices, while having enough leaves. We use these tools 
in Section 3.4 to prove Theorem 1. 

3.1 Reducible Structures 

In this section we introduce a number of reduction rules. The proof of the main theorem 
relies on locally extending a forest until it becomes spanning while guaranteeing a certain 
number of leaves for every intermediate forest. The reductions help to delay the treatment of 
some sTibstructures which cannot be readily handled during the extension process and they 
also simplify the case study in the main proof. 

Ignoring rules that disconnect the graph, the main idea behind the reduction rules is as 
follows. A graph G is reduced to a graph G' with n>3(G) — n>^{G') = k, such that every 
spanning tree of G' can be turned into a spanning tree of G with at least k/Z additional 
leaves. This preserves the desired leaf ratio. Lemma 3 states this idea more precisely. 

We now give the necessary definitions. A vertex with degree at most two will be called 
a goober. We adopt this notion from [10], although there it is defined differently. In [10] goober 
goobers are those vertices of degree at most two resulting from a reduction rule. We observe 
that nowhere in the proofs the extra structural information which this definition may provide 
is actually used. Hence goobers may simply be defined as we do here. The important gain 
is that now we do not have to require graphs to have minimum degree 3 in our statements. 
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One important convention is that goobers are always defined with respect to the whole graph, 
that is when wc consider a subgraph H of G, a vertex v of if is a goober if dG(v) < 2. In our 
figures, white vertices indicate goobers. A high degree vertex is a vertex of degree at least 4. high degree 

We first repeat the seven reduction rules defined in [10], and then introduce five new rules vertex 
which are designed to handle structures containing higher degree vertices. While the first 
seven rules are defined in [10] for graphs with maximum degree 3 we define them for arbitrary 
graphs, but the vertices on which they act must have the same degrees as in the original 
definition. 

The seven reduction rules from [10] consist of graph operations on certain structures, and 
conditions on when they may be applied. Figure 6 shows the operations. The black vertices 
all have degree 3, and goobers are shown as white vertices. Dashed edges are present in the 
resulting graph if and only if they exist in the original graph. The numbers above the arrows 
indicate the decrease in n>3, and the numbers below the arrows indicate the number of leaves 
that can be gained in a spanning tree when reversing the reduction. 




Figure 6: The seven low degree reduction rules 

The following restrictions are imposed on the application of these rules (see Section 3 of [10]): 

• Reductions (1), (3), (4) and (5) may not be applied if the two outgoing edges from 
the left side, or the two outgoing edges from the right side, share a non-goober end 
vertex. (An outgoing edge from the left and an outgoing edge from the right may share 
a non-goober end vertex.) 

• Reduction (7) may not be applied if any pair of outgoing edges shares an end vertex. 

In other words, a rule may not be applied if it would introduce multi-edges incident with 
non-goobers, or if it would introduce a diamond. These seven reduction rules will be called 

the low- degree reduction rules. 

We define an invariant that exhibits the properties which should be maintained while 
doing graph reductions. 

Definition 4 (Invariant). A graph H is said to satisfy the invariant if: 

• if is connected, or every component of H contains a goober, and 



low- degree 

reduction 

rules 

satisfy the 
invariant 



7 



• every component of H is either simple or it is a + e, and 



H contains neither 2-necklaces nor 2-blossoms. 



The reduction rules are applied in the induction step in the proof of our main theorem; this 
invariant states the important properties that should be preserved in the reduction process. 

Lemma 1. Let G' be obtained from G by the application of a low-degree reduction rule. If G 
satisfies the invariant then so does G' . 

Proof. Note that the reduction rules (l)-(6) only introduce goobers as new vertices and the 
only new edges are incident to these goobers. Furthermore all other vertex degrees remain 
unchanged. Hence these reductions cannot introduce 2-necklaces or 2-blossoms. Rule (7) 
cannot introduce a 2-blossom since a 2-blossom cannot share a vertex with a triangle induced 
by three vertices of degree three. This is not true for 2-necklaces, but if rule (7) introduces 
a 2-necklace, two of the outgoing edges share an end vertex, contradicting the condition for 
applying rule (7). 

For all of the rules that may disconnect the graph, it is clear that both new components 
will contain a goober. So the only way in which one of the reductions might violate the 
invariant is by introducing multiple edges. But using the imposed restrictions it can be seen 
that multiple edges can only be introduced between two goobers, giving a K2 + e. □ 

We now introduce five new reduction rules, which we call the high- degree reduction rules. 
Each rule again consists of a graph operation and conditions on the applicability. Figure 7 
shows the graph operations for the five rules. 

The encircled vertices are the terminals, which may have further incidences, unlike the 
other vertices. None of the vertices in the figures may coincide, but there are no restrictions 
on outgoing edges sharing end vertices. The numbers above the arrows indicate the decrease 
in n>3, and the numbers below the arrows indicate the number of leaves that can be gained 
in a spanning tree when reversing the reduction. Since (R4) must disconnect a component, 
this notion is not relevant for (R4); this rule will be treated separately below. 



high- degree 

reduction 

rules 



(Rl) 



(R3) 




(R2) 



(R4) 



u 




u 



w 



u 



u 



\> V 



(R5) 



u V 



\U V 



Figure 7: The high-degree reduction rules. 

The following restrictions are imposed on the applicability of these operations to a graph G. 
First, none of the reduction rules may be applied if it introduces a new 2-necklaceor 2-blossom. 
In addition, the following rule-specific restrictions are imposed. Let cc{H) denote the number 
of connected components of a graph H. 



cc{H) 



(Rl) dciv) > 4. 
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(R2) daiu) > 4 and daiv) > 4. 

(R3) cc{G') = cc(G), the edge uw is not in G, and in addition dQ'{v) > 3, or dciw) > 3, or 
both. 

(R4) cc{G') > cc{G), that is G' is not connected. 

(R5) daiu) > 4, dciv) > 4, and may not be a bridge. 

A bridge is an edge whose deletion increases the number of components. In the remainder, we bridge 
will call a reduction rule admissible if it can be applied without violating one of the imposed admissible 
conditions. In particular the condition that no 2-necklaces or 2-blossoms are introduced will 
be important. Since the high-degree reductions rules are defined such that no 2-necklaces or 
2-blossoms can be introduced, it is easy to see that the following lemma holds: 

Lemma 2. Let G' he obtained from G by the application of a high-degree reduction rule. If 
G satisfies the invariant then so does G' . 

Observe that in particular, (R5) seems counterproductive when the goal is to find spanning 
trees with many leaves, but it is useful to keep the case analysis in the proof of Lemma 6 
simple. 

Definition 5 (Reducible). A graph G is reducible if one of the low-degree or high-degree 
reduction rules can be applied, and irreducible otherwise. 

Griggs et al. [10] call a graph irreducible if none of the low-degree reduction rules can 
be applied. Clearly, a graph that is irreducible according to our definition is also irreducible 
according to their definition, so we may apply their lemmas for irreducible graphs also using 
the above definition of irreducibility. 

Note that irreducible graphs satisfying the invariant are simple because of reduction 
rule (2). Components with only one vertex will be called trivial components in the sequel. 

We now show that we can reverse all of these reduction rules while maintaining spanning 
trees for every component, having the proper number of leaves. For the low-degree reduction 
rules, this lemma was implicitly proved in [10]. So for the detailed tree reconstructions we 
refer to [10], but we do repeat the main idea behind the proof here. 

Lemma 3 (Reconstruction Lemma). Let G' be the result of applying a reduction rule to a 
connected graph G and a > 0. If G' has k non-trivial components Ci, . . . ,Ck, which all have 
a spanning tree with at least n>s{Ci)/3 + a leaves, then G has a spanning tree T with 

i{T) > n>3(G)/3 + afe - 2{k - 1). 

Note that the reduction rules create at most two components, that is A; < 2. We use this 
lemma with a = 4/3 if G' is connected, and with a = 2 otherwise. 

Proof. Suppose the applied rule was a low-degree reduction rule. Note that cc{G') is either 
1 or 2. If G' is connected, then its spanning tree can be turned into a spanning tree of G 
with (n>3(G') — n>3(G'))/3 more leaves. To prove this, it is shown in Section 3 of [10] for 
every rule how to adapt the tree of G' for G {tree reconstructions). This already proves the 
statement if cc{G') = 1. If cc{G') = 2, then applying the same tree reconstructions yields a 
spanning forest of G consisting of two trees, with again (ra>3(G) — n>3(G'))/3 more leaves in 
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total. If both components of G' are non-trivial (A; = 2), then the two resulting trees of G can 
be connected to one spanning tree T by adding one edge, losing at most two leaves. In that 
case we have: 

t{T) > n>3(G')/3 + 2a + (n>3(G) - n>3(G'))/3 - 2 = n>3(G)/3 + ak - 2(k - 1). 

If exactly one of the two components is trivial {k = 1) then the applied rule must be Rule 
(2) or (3). In this case, it can be checked that after the tree reconstruction, one edge can be 
added without decreasing the number of leaves; one leaf is lost but an isolated vertex becomes 
a leaf. Then we have: 

^{T) > n>3(G")/3 + a + (n>3(G) - n>3(G'))/3 = n>3(G)/3 + ak - 2{k - 1). 

If both components of G' are trivial (k = 0), then the applied rule was (2), and G = K2, 
for which the statement holds: —2{k — 1) = 2. This proves the lemma when a low-degree 
reduction rule is applied. 




Figure 8: Spanning tree constructions when reversing the new reduction rules 

Now we consider the high-degree reduction rules. Note that rules (Rl), (R2), (R3) and 
(R5) do not increase the number of components, so A; = 1. So for (R5) we do not have to 
change the spanning tree of G' . For (Rl), (R2) and (R3), Figure 8 shows how to gain at least 
one additional leaf in every case, which suffices since each of these rules decreases n>3 by at 
most three. Here it is essential that (R3) is admissible only if it creates at most one goober. 
Dashed edges in the figure are present on the right if and only if they are present on the left. 
Symmetric cases arc omitted in the figure. Note that none of the terminals of the operations 
can lose leaf status, except w in the second reconstruction for (R3). This is compensated by 
gaining two new leaves here. So in every case enough leaves are gained to maintain the ratio. 

Recall that (R4) is only admissible if it disconnects G into two components, which will be 
non-trivial, so k = 2. Figure 8 shows how to construct a spanning tree for G from the two 
spanning trees for the components, without decreasing the total number of leaves. Hence the 
number of leaves of the resulting tree is at least 

n>3(G')/3 + 2a = ra>3(G)/3 - 5/3 + 2a > n>3(G)/3 + 2a - 2 = n>3(G)/3 + ak - 2{k - 1) 

This proves the lemma for all reduction rules. □ 

The following property of irreducible graphs substantially simplifies subsequent proofs. 
Here G7 denotes the graph from Figure 5 (b). 
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Lemma 4 (Edge Deletion). Let G he an irreducible graph not equal to G-j with adjacent 
vertices u and v. If d[u) = d[v) = 4, then uv is a bridge, or one of u,v becomes an inner 
vertex of a cubic diamond upon deletion of the edge uv. 

Proof. Suppose for the sake of contradiction a non-bridge edge uv exists, between vertices of 
degree 4, such that none of u, v becomes an inner vertex of a diamond upon deletion of uv. 

Since G is irreducible, no reduction rule is admissible. Clearly, this must mean that a 
2-necklace or 2-blossom is introduced when uv is deleted, that is when (R5) is applied to uv. 
In either case, wc will derive a contradiction to the irreducibility of G. 

Claim 1 The graph G — uv docs not contain a 2-necklace A'^. 

Suppose for the sake of contradiction that G — uv does contain a 2-necklace N. Consider A'^ 
as a subgraph of G (so uv is counted towards the degrees of u and v). 

We first treat the case that N consists of at least two diamonds. If one of the diamonds in 
N contains three vertices of degree 3, we can use rule (Rl), see Figure 9 (a). So now we may 
assume that one diamond on the end of the necklace contains u as one of the three vertices 
not shared with the next diamond, and the diamond on the other end of the necklace contains 

V this way. 

If u is a vertex with degree 2 in N, then (R2) can be applied, see Figure 9 (b). This does 
not introduce a 2-nccklace since the degree 4 vertex v is part of N on the other end. Because 

V is part of a diamond, this can also not introduce a 2-blossom. 

In the remaining case, both u and v are internal vertices of their respective diamonds. 
Now it is admissible to apply (R5) to a different edge incident with u, see Figure 9 (c) , where 
the dashed edge is the deleted one. 

This does not introduce a 2-necklace or 2-blossom: u becomes part of a triangle that is 
induced by degree 3 vertices, for which all outgoing edges have different end vertices. Such 
a triangle cannot be part of a 2-blossom or 2-necklace. The other end vertex of the deleted 
edge is still part of a diamond after deletion, and thus is not part of a 2-blossom. It is not 
part of a 2-necklace since v is in this part of the necklace. 

This concludes the case where N consists of at least two diamonds. 




Figure 9: Reductions when a long 2-necklace is created. 

Now suppose consists of a single diamond. If u is an inner vertex of this diamond, then 

V cannot be part of the same diamond since we are dealing with simple graphs. This is then 
the case we excluded by assumption, see Figure 10 (a). So without loss of generality u is one 
of the vertices that have degree 2 in the diamond. 

Now rule (Rl) or (R2) is admissible, depending on whether v is also in the diamond, see 
Figures 10 (b) and (c). This does not introduce a 2-blossom or 2-necklace, since in the case in 
Figure 10 (b), a triangle containing a goober is introduced, and in the case in Figure 10 (c), 

V has degree 4 and a goober at distance 2. Note that also no parallel edges are introduced: 
in the case in Figure 10 (c) the edges leaving the diamond are distinct, that is deleting uv 
does not give a K4, since in that case uv would have been a bridge. This shows that it is 
admissible to apply either (Rl) or (R2), which contradicts the irreducibility of G. A 
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(a) (b) (c) 

(R2) 1^ ^ (Rl) V 



Figure 10: Reductions when a cubic diamond is created. 



Claim 2 The graph G — uv docs not contain a 2-blossom B. 

For B we use the vertex labels from Figure 11 (a). The degree 4 vertex of B is labeled 
b, its terminals are called c-vertices, and the remaining four vertices are called its a-vertices. 
Now consider i? as a subgraph of G (so uv is counted towards the vertex degrees). Since 
doiu) = 4 = daiv) neither of them is equal to b, since b has degree 4 even after the deletion 
of uv. 




04 03 



Figure 11: The blossom B after deleting uv. 



If u is an a- vertex, say without loss of generality u = ai, then it is admissible to delete 
the edge connecting u to b instead, see Figure 11 (b). We argue that this does not introduce 
a 2-blossom or 2-necklace. Figure 12 shows the possible results of deleting ub in more detail, 
depending on the position of v. First suppose 17/04. After deleting ub, b becomes part of 




Figure 12: Possible results of deleting ub. 

a triangle that does not share a vertex with another triangle, since we assumed v 7^ 04. It 
follows that b is neither part of a 2-necklace, nor of a 2-blossom. The vertex u may be part of 
a triangle (when = C2 or when v is not in B but adjacent to ci), but such a triangle is not 
part of a diamond, hence u is not part of a 2-necklace. Finally we argue that u is not part of 
a 2-blossom: since b is not part of a 2-blossom, its neighbor 02 is not part of a 2-blossom B' 
unless it is a terminal of B'. In that case it is not part of a triangle, but its neighbor C2 is, 
which is impossible. Hence 02 is not part of a 2-blossom. Then if u is part of a 2-blossom B', 
it must be a terminal of B', and thus not part of a triangle, but its neighbor ci must be part 
of a triangle. This is again not possible. This concludes the proof that ii v 04, deleting ub 
is an admissible application of (R5). 

Now we need to consider the case case that v = and u = ai, see Figure 12 (e). Deleting 
ub does not introduce a 2-necklace, but there is exactly one way in which it may introduce a 
2-blossom, which has v as its central degree 4 vertex. Figure 13 shows this case, the bold edges 
indicate the new blossom. But now it can be seen that the original graph, which includes ub, 
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V 



Figure 13: Deleting uh yields a blossom. 

is exactly G-j, a contradiction with oTir assumption. We conclude that if u is an a-vcrtcx and 
G ^ G-j^ in every case the edge uh can be deleted by an admissible application of (R5). 

It remains to consider the case that u is a c-vertex. Then, (R3) could be used, see 
Figure 14 (a) and (b) . The bold edges indicate the structure reduced by (R3) . If in case (a) 
a 2-necklace is introduced, v would be an inner vertex of one of its diamonds, but that is not 
possible since d[v) = 4. In case (b) no 2-necklacc can be introduced, since v is part of at most 
one triangle. In neither case a 2-blossom is introduced. A 




Figure 14: More reductions if a 2-blossom is created. 

We have thus derived a contradiction to the irreducibility of the graph for all cases where 
deleting uv would not be an admissible application of (R5), which proves the lemma. □ 



3.2 Using the Result for Cubic Graphs to Prove Tree Extendibihty 

Using our observation that the results in [10] hold when goobers are simply defined as vertices 
with degree at most 2, we may restate Theorem 3 from [10] as follows. 

Theorem 3. Every irreducible graph G of maximum degree exactly 3 and without cubic 
diamonds has a spanning tree with at least n>3(G)/3 + a leaves, where a = 4/3 if G is cubic 
and a = 2 otherwise. 

We give a short overview of the proof of this statement, as it appears in [10]. For a 
subgraph T of G, in addition to ^(T) the following values are considered. By naiT) we nciT) 
denote the number of non-goober vertices of G that are in V{T). By ld{T) we denote the (.d{T) 
number of dead leaves of T, that is leaves of T, which have no neighbor in V{G) \ V{T). ^^q^iI leaves 

The value of T is defined as 2. 5£(T) -|-0.5£d(r) — nG(T). First it is shown in [10] that a tree value 
T with value at least 4 can always be found, and even one with value at least 5.5 if H contains 
at least one goober. Next, it is shown that every non-spanning tree T can be extended, that 
is a tree supergraph T' of T can be found with value at least the value of T. This part of 
the proof consists of a rather involved case study. The extensions can be repeated until a 
spanning tree is found, in which case all leaves are dead. Rewriting the value expression, and 
rounding up the start value then yields Theorem 3. 

We observe that nowhere in the case study that proves extendibihty any information about 
the current tree T is used; loosely speaking, only information about the part of H 'outside' of 
T is used. In particular, the fact that T is connected is never used in the proof, and neither 
are upper bounds on degrees of vertices already included in T. 
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We will now define the leaf potential of a subgraph which generalizes the above definition 
of the value of a tree, and we will formalize the notions 'extendible' and 'outside'. Using these 
new notions a useful lemma can be formulated, which we conclude is proved, but not stated 
in [10]. 

Definition 6 (Leaf- Potential) . The leaf-potential of a subgraph F C G is Vg{F)= 2.5£(F) + 
QMd{F)-nG{F)-Qcc{F). 

If ambiguities are excluded in the context we simply write V{F). 



leaf- 
potential 
Vg{F) 



Definition 7 (Extendible). Let F be a subgraph of a graph G. Then F is called extendible extendible 
if there exists an F' with F C F' C G and Vg{F') > Vg{F). 

Above we already informally mentioned the subgraph of G 'outside' a subgraph F C G. 
Considering the proof in [10], we see that this graph may formally be defined as an edge 
induced graph as follows. 



Definition 8 (Graph Outside F). Let F be a non-spanning subgraph of G. The subgraph of 
G outside of F is F^= G[{uv G E{G) : u V{F)]\. The boundary of F is V{F) n V{F^) 

Note that no edges between two vertices that are both in V{F) are included in F*-^. If G 
is clear from the context we call F'^ the graph outside F. Expressed using these definitions, 
the case study in [10] yields the following lemma. 

Lemma 5 (Extension Lemma) . Let G be a connected irreducible graph, and let F d G such 
that F^ has maximum degree 3 and contains no cubic diamonds. Then F is extendible. 



subgraph of 
G outside of 
F 
pC 

boundary 
graph 
outside F 



3.3 Growing Trees around High Degree Vertices 

The purpose of this section is to prove Lemma 6, which grows trees around high degree 
vertices and yields a graph satisfying the assumptions of Lemma 5. Lemma 6 is the core of 
our proof of Theorem 1. 

We denote the set of vertices of G which are not in F by V{F) = V{G)\ V{F). The 
neighborhood N{v) of a vertex v is the set of all vertices adjacent to v, and the closed 
neighborhood of v is N[v] = N{v) U {v}. Expanding a vertex v G V{G), which is an operation Expanding 
on a subgraph F of G, yields a new subgraph with vertex set V{F) U N[v], and edge set 
E{F) U {uv : u G N[v]\V{F)}. So all newly added neighbors of v become leaves, and v 
may lose leaf status. The number of components increases by one if and only ii v ^ V{F). 
Expanding a list of vertices means expanding the vertices in the given order. 

We adopt the short-hand notation A{x,y,z):= 2.5y -\- 0.5z — x from [10] to express the A{x,y,z) 
change in Vq when extending a graph F to a new graph F'. Let A£ denote i{F') — £{F), and 
define Aid and Aug analogously. So when F and F' have the same number of components, the 
extension is valid if and only if A(AnG, Ai, Ai^) > 0, and if a new component is introduced 
we need A(AnG, Ai, Aid) > 6. 

For the sake of simpler notation, instead of writing e.g. A(AnG, Ai, Aid) > ^(4, 3, 1) = 4, 
we will simply write A(4, 3, 1) = 4. Hence the three parameter values need not be exactly 
Aug, Ai and Aid but reflect the worst case scenario. That is, the change in the leaf potential 
that we prove is to be read as a lower bound for the actual change. 
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Lemma 6 (Start Lemma). Let G be an irreducible graph not equal to Gj and F a (possibly 
empty) subgraph of G, such that F^"' contains at least one vertex of degree at least 4j o-n-d 
contains neither 2-necklaces or 2-blossoms. Then, F is extendible. 

Proof. First suppose F is not the empty graph. If there is a vertex v on the boundary of F 
which is not a leaf, then F' can be obtained by expanding v. There is no leaf lost since v 
was not a leaf, and the newly added vertices are leaves. So the augmentation inequality is 
satisfied: A{k,k,0) > 0. Hence, we may assume in the remainder that only leaves of F have 
neighbors in V{F), or in other words, all vertices on the boundary of F are leaves of F. 



(AIL 



(A2) 



A(0,0,0) = 

t 



A{i + 2,i,j)>0iii,j > 1 




A(i + 3,i,0) >'0 if i > 2 




(A3) 



A{i + 1, i, 0) > 0.5 if z > 1 A(l, 0, i) > if ? > 2 
(A5)^ 




A{i + 2,1,0) > 1 if i > 2 



(A7), 




t®' 

A(i + 3,i,l) > 0.5 if i > 2 



Figure 15: Simple augmentations of an existing subgraph 

The next step is the attempt to augment F using the operations (A1)-(A7), see Figure 15. 
Conventions for this figure are that encircled vertices belong to V{F), solid edges show the 
expansion and vertex degrees shown are to be understood as lower bounds. Dead leaves arc 
marked with a cross. All of the expansions in the figure extend F without creating a new 
connected component, and satisfy A(AnG, A^, A^,^) > 0. Thus the resulting graph F' is an 
extension as claimed in the lemma. Together these augmentation rules yield the following 
claim. 



Claim The subgraph F is extendible, if a vertex in V{F) has a goober neighbor in V{F) 
or at least two neighbors in V{F), or if there is a high-degree vertex v G V{F) at distance at 
most two from F. 



If a goober from V(F) is adjacent to F then (Al) can be applied. If a vertex in V{F) has 
at least two neighbors in V{F), (A2) can be applied. So from now on we will assume every 
vertex in V{F) has at most one neighbor in V{F), and this neighbor is not a goober. If a 
high-degree vertex in V{F) is adjacent to a vertex in V{F), (A3), (A4) or (A5) can be applied. 
The creation of the dead leaves in (A3) and (A4) follows from the fact that (A2) cannot be 
applied anymore. If a high-degree vertex in V{F) has distance two from a vertex in V{F), 
(A6) or (A7) can be applied. A 

The rest of the proof will handle the more complicated case when F is the empty graph, or 
the only high-degree vertices in V{F) are at a larger distance from V{F). We then introduce a 
new component for F. This is more complicated because adding a further component comes at 
a certain cost, more precisely we need that the new component satisfies A(AnG, A£, Ala) > 6. 
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The rest of the proof is divided into three more claims. The first one handles the easiest 
cases, and the second one handles all cases except those where every degree 4 vertex is the 
common vertex of two edge-disjoint triangles. This final case is then taken care of in the third 
claim. Throughout the proof we assume, sometimes implicitly, that none of the situations 
that have been handled earlier can occur. 

Claim 1 Let v G V{F), d{v) > 4, and w G N{v). In the following four situations F is 
extendible: d{v) > 5, or d{v) = 4 and is a goober, or d{v) = d{w) = 4, or d{v) = 4 and 

N[w] C N[v]. 

First note that no vertex in N[v] or N[w] is part of F by Claim 0. If d{v) > 5, expanding 

V yields A(fc + 0) > 6.5, since k > 5. For d{v) = 4 and w a goober, expanding v gives 
A(4,4,0) = 6. 

Now suppose d{v) = d{w) = 4. If vw is a bridge, expanding v and w yields A(8, 6, 0) = 7. 
Otherwise, Lemma 4 shows that either v or w, say v, becomes the inner vertex of a cubic 
diamond upon deletion of the edge vw. (Note that we assumed G / Gj, so Lemma 4 
may be applied.) Thus, either w has two neighbors not in N[v] and expanding v,w yields 
A(7, 5, 1) = 6 (see Figure 16 (a)), or w shares two neighbors with v in which case expanding 

V yields A(5,4, 3) = 6.5 (see Figure 16 (b)). 

So now we may assume that all neighbors of v have degree 3. If N[w] C N[v] then either 
the unique vertex u G A''[f]\A'^[Ti;] has two neighbors not in N[v], in which case expanding 
u,v gives A(7, 5,1) = 6, see Figure 16 (c), or there is another vertex x G N[v] — w with 
N[x] C N[v], and v is expanded to obtain A(5,4, 2) = 6, see Figure 16 (d). A 




Figure 16: Figures for Claim 1. 

Summarizing, we may now assume that V{F) contains no vertices of degree at least 5, and 
if it contains a vertex v of degree 4, all neighbors of v have degree 3 and have either one or 
two neighbors not in N[v]. 



Claim 2 If V{F) contains a vertex v with d{v) = 4 and a vertex w G N{v) which has two 
neighbors a,b ^ N[v], then F is extendible. 

We denote the other three neighbors of v by x, y, z. If one of a, b, x, y, z has all of its neighbors 
in {a, b} U N[v], we have A(7, 5, 1) = 6 by expanding v, w, see Figure 17 (a). 

If a or 5 is a goober we obtain A(6, 5, 0) > 6.5, see Figure 17 (b). If a or 6 is adjacent to 
a vertex c G V{F), then expanding v, w will make c a dead leaf and yields A(7, 5, 1) > 6, see 
Figure 17 (c). If one of a,b,x,y,z has at least two neighbors not in N[v] U {a,b}, we obtain 
A(9, 6, 0) = 6 by expanding v,w and this vertex, see Figure 17 (d). 

Hence we may assume that a, b, x, y, z each have exactly one neighbor outside N[v]U{a, b}. 
This neighbor is not part of F. Since they all have degree at least 3, these five vertices must 
induce three edges. This implies that one of a, b has degree 4 since we already know that 
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Figure 17: Figures for Claim 2. 



X, y, z have degree 3. We may assume without loss of generality that d{a) = 4 and d{b) = 3. 
We distinguish two cases depending on whether a is adjacent to b or not. The statement 'a 
is adjacent to 6' is denoted by o ~ 6. We denote the neighbor of x outside of N[v] U {a, 6} by a ■ 
x', and similarly a',b',y', z' are defined. 

Case 1. a is adjacent to x and y while b is adjacent to z. 

Consider expanding v,x,z. All vertices in {a,b,w,y,x',z'} arc adjacent to at least one 
of v,x,z, thus we have A(9, 6, 1) = 6.5 unless x' = z' , see Figure 18 (a). By an analogous 
argument with y in the place of x we may now assume that x' = z' = y'. Then, expanding 
v,x yields A(7, 5, 1) = 6, since y becomes a dead leaf, see Figure 18 (b). 



(a) 



X, . 

1 z 


X 














h. 






Figure 18: Figures for Claim 2, Case 1 



Case 2. a is adjacent to b and x while y is adjacent to z. 

If x' 7^ a', expanding a,x,v yields A(9, 6, 1) = 6.5, see Figure 19 (a), so we may assume 
that x' = a' =: c, and this creates a situation symmetric in b and c. By Claim 1 we have that 
6 9^ c. Now first suppose b' ~ c. Then expanding b', c, x, v yields A(10, 6, 3) = 6.5, provided b' 
has a neighbor d other than y,z, see Figure 19 (b). Note that d G V{F) is not possible since 
augmentation (A6) could have been applied instead. 




Figure 19: Figures for Claim 2, Case 2 

If N{b') = {b,c,y}, then (R3) is admissible, see Figure 19 (c). Since b' becomes a goober 
this cannot introduce a 2-necklace. Hence it must be that b' ~ y,z and the graph has 
A (9, 5, 5) = 6, see Figure 20 (a). This concludes the cases with b' ~ c. 

The case b' ^ c can be excluded because then (R3) would be admissible, see Figure 20 (b). 
Note that this cannot create a 2-necklace involving y^z since then (R2) would have been 
admissible. 
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Figure 20: Additional figures for Claim 2, Case 2. 



This concludes the proof of Claim 2. A 

Summarizing Claims 0, 1, and 2, we may now assume that all neighbors of a degree 4 
vertex v G V{F) have degree 3, and have exactly one neighbor not in N[v]. In other words, 
V is the common vertex of two edge-disjoint triangles, see Figure 21. 




Figure 21: The bow tie subgraph 

Claim 3 If the graph outside F contains a vertex v with d{v) = 4 such that all its neighbors 
have degree 3 and one neighbor outside N[v], then F is extendible. 

Wc denote the neighbors of v by p',q',r',s' and assume that p' ~ q' and r' ~ s' . The 
neighbor of p' outside N[v] is denoted by p and similarly q, r, s are defined, see Figure 21. We 
split the proof of the claim into three cases. 

Case 1. p = q 

If p has degree 3, we can apply (Rl), see Figure 22 (a). So now without loss of generality 
p has degree 4. Then by Claim 2, p is also part of two edge-disjoint triangles. So if p = r 
then also p = s. In that case we can expand p', v to obtain A(6, 4, 4) = 6, sec Figure 22 (b). 
So now p ^ r, p ^ s. Consider applying (R2) to the diamond consisting of p,p',q',v, see 
Figure 22 (c). If this introduces a 2-necklace, (Rl) could have been applied to the diamond 
on the other end of this necklace. It cannot introduce a 2-blossom since the triangles of a 
2-blossom contain a degree 4 vertex. 



t 




Figure 22: Figures for Claim 3, Case 1 

Case 2. p = r 

Note that d{p) = 3 by Claim 2. Also q ^ s since the graph does not contain 2-blossoms 
and the case that d{q = s) = 4 is again excluded by Claim 2. Thus, (R3) is admissible, see 
Figure 23. 
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Figure 23: The figure for Claim 3, Case 2 



Case 3. p,q,r,s pairwise different. 

In this case either (R4) or (R3) is admissible: if v is a cut vertex, (R4) may be used (it 
increases the number of components). Otherwise, we may assume without loss of generality 
that p and s arc in same connected component of G — and (R3) can be applied without 
disconnecting the graph. See Figures 24 (a) and (b). A 
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Figure 24: Figures for Claim 3, Case 3 



This concludes all possible cases: whenever the subgraph of G outside of F contains a 
high degree vertex, we have shown that G is either reducible, or F is extendible. □ 



3.4 The Proof of the Main Result 

This section is devoted to combining the tools developed in the last three subsection in order 
to prove Theorem 1, which we repeat here for convenience. 

Theorem 1. Let G be a simple, connected graph on at least two vertices which contains 
neither 2-necklaces nor 2-blossoms. Then, G has a spanning tree T with 



^(T) >n>3(G)/3 + 



4/3 if 5{G) > 3 
2 if6{G)<2. 



Proof. We prove the statement by induction. For our induction hypothesis we actually prove 
that the above statement holds for every connected graph which satisfies the invariant. Then 
the statement follows for simple graphs. 

First suppose G is irreducible. If G has maximum degree exactly 3, Theorem 1 follows 
immediately from Theorem 3. If G has maximum degree at most 2, G has a spanning tree 
with at least two leaves (note that we assumed that G is not a K^), which suffices. If G = G7, 
then a spanning tree with 4 = n>3(G)/3 + 5/3 leaves can be obtained. So we may now assume 
that G contains at least one high degree vertex, and is not equal to G7. 

We start with an empty subgraph F of G, which has Vg{F) = 0. The Start Lemma 
(Lemma 6) shows that, as long as there is at least one high degree vertex not in F, we can 
extend F while maintaining Vg{F) > 0. When all high degree vertices are included in F, 
the Extension Lemma (Lemma 5) can be applied iteratively, until a spanning subgraph F' is 
obtained with Vg{F') > 0. Without loss of generality, we may assume that F' is a forest; 
cycles can be broken without decreasing the number of leaves. Since all leaves of a spanning 
subgraph arc dead we deduce 

< VaiF') = U{F') - n>3(G) - 6cc(F') =^ i{F') > n>3(G)/3 + 2cc{F'). 
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We can now add cc{F') — 1 edges to F' to obtain a spanning tree, losing at most 2{cc{F') — 1) 
leaves, so the resulting tree has at least n>3(G)/3 + 2 leaves. 

It remains to consider the case that G is reducible (the induction step). Some reduction 
rule is admissible, and the reduced graph G' again satisfies the invariant, by Lemmas 1 and 2. 

First suppose G' is connected. None of the reduction rules remove goobers, so if 5{G) < 2, 
then 6{G') < 2, and by induction G' has a spanning tree with at least n>3(G')/3 + 2 leaves. 
Lemma 3 then shows that G has a spanning tree with at least n>3(G)/3 + 2 leaves. Similarly, 
if S{G) > 3 then it follows that G has a spanning tree with at least n>3(G)/3 + 4/3 leaves. 
Now suppose the reduction rule yields a disconnected graph G'. Then, by the definition of the 
reduction rules, every resulting component has a goober. So by induction, every non-trivial 
component G of G' has a spanning tree with at least n>3(C)/3 + 2 leaves. Thus Lemma 3 
implies that G has a spanning tree with at least ra>3(G)/3 + 2 leaves. □ 



4 A fast FPT Algorithm for MaxLeaf 

In this section we present a fast and relatively simple FPT algorithm for MaxLeaf, which 
uses Theorem 1 as an essential ingredient. The other two ingredients arc a short preprocessing 
step, consisting of two reduction rules, and an enumerative procedure, which is similar to the 
one introduced in [3], and also applied in [2]. 

We start by presenting the two reduction rules that constitute the preprocessing phase. 
The reduction rules for the FPT algorithm are different from the rules used in Section 3. It 
is important that they yield an equivalent instance of the decision problem MaxLeaf, but 
there are no conditions on the ratio between the decrease in vertices and in possible leaves. 

Recall that in 2-necklaces and 2-blossoms both terminals have degree 3 in G. The rules we 
introduce now also reduce diamonds and blossoms whose two terminals have arbitrary degree. 
However the two terminals of the subgraph must still be the two vertices that have degree 2 
in the diamond necklace or blossom itself. Such a subgraph of G will be called a 2-terminal 
diamond respectively a 2-terminal blossom. Rule (Fl) in Figure 25, which resembles rule (R2), 2-terminal 
reduces 2-terminal diamonds. Since the 2-necklace consists of k 2-terminal diamonds it is diamond 
reduced as well by rule (Fl). Rule (F2) in Figure 25 reduces 2-terminal blossoms. The next 2-terminal 
lemma proves the correctness of these rules. blossom 




Figure 25: Two reduction rules for an instance (G, k) of MaxLeaf. 



Lemma 7. Let G' be the result of applying reduction (Fl) or (F2) to G. Then (G' , k — 1) is 
a YES-instance for MaxLeaf if and only if {G, k) is a YES-instance for MaxLeaf. 

Proof. First consider the case where the applied rule was (Fl), reducing a 2-terminal diamond 
D ofG with terminals u and v. Consider a spanning tree T of G with at least k leaves. Observe 
that in T, we can always replace the set of edges E(T) n E{D) by one of the two sets shown 
on the right in Figure 26 (a), or a symmetric set, without decreasing the number of leaves. 
Then by replacing it by the corresponding structure on the left, we obtain a spanning tree 
T' of G' with at least k — 1 leaves. Note here that the terminals u and v remain leaves in 
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T' if they are leaves in T. Similarly, any spanning tree of G' has one of the forms shown on 
the left when restricted to the two edges resulting from the reduction. Replacing it with the 
corresponding structure on the right shows that if {G\ A: — 1) is a YES-instance, (G, k) is as 
well. Figure 26 (b) can be used to prove the statement for rule (F2) analogously. For this it is 
useful to note that Proposition 1 shows that we may assume that T restricted to the blossom 
has one of the forms on the right of Figure 26 (b) . □ 



Throughout this section we will denote the set of leaves of a graph G by L{G). We now 
explain how to obtain a graph S{G) from a graph G by suppressing vertices. Suppressing a 
vertex u of degree 2 means deleting u and adding an edge between the two neighbors of n, 
or a loop if both edges incident with u end in the same vertex. We allow this operation to 
introduce parallel edges and loops, so the degrees of non-suppressed vertices are maintained. 
If l^>3(G')l = 0) that is G is a path or cycle, then S{G) is the empty graph. If |1^>3(G)| > 
then S{G) is obtained from G by suppressing all degree 2 vertices. So V{S) = L(G)uy>3(G), 
and G is a subdivision of S{G). Hence loops and non-loop edges of S{G) correspond to cycles 
and paths of G respectively. Let uv be a non-loop edge of S (G) where the corresponding path 
Puv in G has i internal vertices. We define a cost function c on the non-loop edges of S{G) 
which assigns cost c{uv)= min{i,2} to uv. Thus c{uv) is the maximum possible number of 
leaves that a spanning tree of G can have among the internal vertices of Puv Now we are 
ready to present the FPT algorithm in Algorithm 1. 

Algorithm 1 An FPT algorithm for MaxLeaf 
INPUT: a MaxLeaf instance (G,fe). 

1) while G has a 2-terminal diamond or 2-terminal blossom subgraph do 

G :=the result of applying (Fl) or (F2) to G 
k:=k-l 
end w^hile 

2) if n>3(G) > 3/c or |L(G)| > A; or A; < 2 then return(YES) endif 

3) construct S{G) and c 

4) for ah L C V>^{G) with |L| < A; do 

if G has a spanning tree T with L C L{T) and \L\ + \L{T)\V>2,{G)\ > k then 

return(YES) 
endif 
endfor 

5) return(NO) 



In the following proofs, we will use the fact that for any L C V{G), a spanning tree T of 
G with L C L{T) exists if and only if G — L is connected and V{G)\L is a dominating set. 



L{G) 
S{G) 

Suppressing 



(a) 





Figure 26: Tree reconstructions for (Fl) and (F2). 
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The decision in Step 4 can be made in polynomial time in the size of S{G). The essential 
step is to solve a minimum weight spanning tree problem on S(G) — L, using edge costs c. 
Lemma 8 contains the details. 

Lemma 8. Let (G, k) he a MaxLeaf instance for which S{G) and c are non-empty and 
known. For any L C V>3{G), deciding whether G has a spanning tree T with L C L{T) and 
\L\ + \L{T)\V>3{G)\ > k can be done in time polynomial in the size of S{G). 

Proof. Let S = S{G). A spanning tree T of G with L C L{T) exists if and only if V{G)\L 
is a connected, dominating set of G. This is the case if and only if V{S)\L is a connected, 
dominating set of S and there is no edge uv G E{S) with u,v e L and c{uv) > 1. These 
properties can be checked in time polynomial in the size of S. 

Now suppose at least one spanning tree Tq of G with L{Tq) C L exists. We show how to 
construct such a spanning tree T that maximizes \L{T)\V>z{G)\. This process is illustrated 
in Figure 27. White vertices indicate vertices in L. 




1 1 1 



S{G) T' Ts 

Figure 27: Constructing a tree T with L C L{T) using S{G). 

First let T' be a minimum weight spanning tree oi S — L, with respect to the cost function 
c. (This tree can be found in polynomial time.) A spanning tree Ts of S is obtained from T' 
by connecting every w G L to T' by an edge e which has minimum cost c(e). So L C L(Ts). 

A spanning tree T of the subdivision G of 5 can be obtained from Ts in the following way. 
All edges from paths of G that correspond to edges in Ts are in T. At this stage T need not be 
spanning. The inner vertices of a path of G corresponding to an edge uv G E{S)\E{Ts) 
with u £ L,v ^ L are connected to T such that u remains a leaf. The inner vertices of a 
path Puv with uv G E{S)\E{Ts) and u,v ^ L can be connected to T such that c{uv) of them 
become leaves of T. Finally, vertices of cycles of G that correspond to loops of S are connected 
to T such that two of them become leaves. Note that an edge uv G E{S)\E{Ts) with u,v e L 
must correspond to a path P^^ in G with no inner vertex since G — L is connected. At this 
point, every vertex of G is connected to T, without introducing cycles, hence T is a spanning 
tree. 

It can be verified that T was constructed such that L(r)\V>3(G) is maximized, under the 
condition that L C L{T). For this it is essential that T' was chosen to be a minimal spanning 
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tree of 5 — L. We omit the formal proof of this fact here, noting that it is similar to the proof 
given in [3] and [2]. 

Observe that T does not actually have to be constructed for the decision. Thus only S (G) 
needs to be considered, and the statement follows. □ 

Lemma 9. Algorithm 1 returns YES if and only if its input {G, k) is a YES-instance for 
MaxLeaf. 

Proof. Lemma 7 shows that it suffices to prove the statement for the reduced instance (G, k). 
Note that the reduced instance (G, k) is again simple, connected and non-trivial and does 
not contain 2-terminal diamonds or 2-terminal blossoms and thus also no 2-necklaces or 2- 
blossoms. So if n>3(G) > 3fc, then {G,k) is a YES-instance by Theorem L The correctness 
of the other cases in which the algorithm returns YES is easily checked. 

Suppose that (G, k) is a YES-instance. We show that the algorithm indeed returns YES. If 
V>3(G) = 0, then G is a path or cycle, so (G, k) being a YES-instance implies k <2, and YES 
is returned in Step 2. Suppose V>3(G) / 0, Step 2 did not return YES, and T is a spanning 
tree of G with at least k leaves. If \L{T) n F>3(G)| > k, then some set L C L{T) n V>s{G) 
with \L\ = k is considered in the algorithm. Clearly also a spanning tree T' exists with 
L C L(T'), so the algorithm returns YES in Step 4. On the other hand, if L = L(T) n V>3{G) 
has fewer than k elements, then L itself will be considered in Algorithm 1, and in addition 
\L\ + |L(r)\VS3(G)| = |L(r)| > k. The algorithm will then return YES in Step 4. □ 

Lemma 9 proves the correctness of Algorithm 1, the claimed time complexity will be 
proved next. Together this proves Theorem 2. We repeat the statement for convenience. 

Theorem 2. There exists an FPT algorithm for MaxLeaf with time complexity 0{m) + 
0*(6.75'^), where m denotes the size of the input graph and k the desired number of leaves. 

Proof. It only remains to prove the complexity bound. 

The first three steps can be done in linear time by building the proper data structures. 
For this it is essential that the degree of non-terminal vertices of 2-terminal diamonds and 
blossoms is bounded by a constant. We give more details now. 

Assume that G is represented by doubly linked adjacency lists. It is possible to detect 
2-terminal diamonds and 2-terminal blossoms because all of their non-terminal vertices have 
degree at most 4. For every vertex u of degree at most 4, we can store the position of u in 
the adjacency lists of its neighbors. To do this for every such u, all adjacency lists of the 
graph need to be scanned only once. This information now makes it possible to apply (Fl) 
and (F2) in constant time for each diamond and blossom respectively. 

Step 2 can obviously be done in linear time. For Step 3 we switch to a representation using 
arrays for the edges, which contain the labels of the two end vertices and the edge weights. 
We also store the vertex degrees, and the labels of the incident edges for vertices of degree 2. 
This representation allows us to do the following operations in constant time: suppressing a 
degree 2 vertex, calculating the resulting edge weight, and updating the representation. 

Thus Steps 1-3 can be performed in linear time and it only remains to consider the 
complexity of Step 4. Since the reductions in Step 1 do not increase the number of vertices 
or the value of fc, we may assume that n and k are the number of vertices and the parameter 
of the reduced instance, as it is after Step 1. 

Step 4 of the algorithm is only executed when |V>3(G)| < 3A; and |i^(G)| < k. Furthermore 
V{S{G)) = L{G) U y>3(G), so every iteration of the for-loop of Step 4 takes time polynomial 
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in k (Lemma 8). This for-loop is executed once for every subset L C V^^^G) with \L\ < k. 
Using 1V>,3(G)| < 3k, the number of such sets can be verified to be C>(fc(^^)). Using Stirhng's 
approximation x^e ^\/27rx, we obtain 

This concludes the proof. □ 

We remark that we did not optimize the polynomial factor suppressed by the O* notation, 
but it can be seen to be a practical, low degree polynomial. 



\k ) i2k)\k\ \ 



5 Conclusions 

We conclude with some remarks about possible improvements. Theorem 1 can be strength- 
ened at the cost of lengthier proofs. An extended version of this paper will show that 
ra>3(G)/3 + 2 leaves can be obtained whenever G is not cubic, and not equal to G7. It 
can also be shown that in order to obtain this bound, 2-blossoms do not have to be excluded; 
it suffices to only exclude larger structures like the flower in Figure 2. This way a bound of 
4n>3(G)/13 + c can be proved when only 2-necklaces are excluded. 

Besides optimizing the parameter function of FPT algorithms, another goal is to find 
better kernelizations. For MaxLeaf, a kernelization can be defined as a preprocessing method 
that reduces the input to an instance (G, k) which is either a YES-instance, or has < 
f{k) for some function f{k). The current best kernelization for MaxLeaf has f{k) = 3.75k, 
see [6]. Since our goal was to avoid preprocessing as much as possible, our method does not 
give a kernelization by itself. But for instance our approach can be combined with three 
simple reduction rules from [3] and [6] to remove all leaves and adjacent degree 2 vertices. 
This yields a 7k kernelization: a reduced NO-instance has less than 3k vertices of degree at 
least 3 by Theorem 1, and less than 4k vertices of degree 2. 
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